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ABSTRACT 

We have analyzed the clustering of ~ 90, 000 narrow-line AGN drawn from the Data 
Release 4 (DR4) of the Sloan Digital Sky Survey. Our analysis addresses the following 
questions: a) How do the locations of galaxies within the large-scale distribution of 
dark matter influence ongoing accretion onto their central black holes? b) Is AGN 
activity triggered by interactions or mergers between galaxies? We compute the cross- 
correlation between AGN and a reference sample of galaxies drawn from the DR4. We 
compare this to results for control samples of inactive galaxies matched simultaneously 
in redshift, stellar mass, concentration, velocity dispersion and mean stellar age, as 
measured by the 4000 A break strength. We also compare near-neighbour counts 
around AGN and around the control galaxies. On scales larger than a few Mpc, AGN 
have almost the same clustering amplitude as the control sample. This demonstrates 
that AGN host galaxies and inactive control galaxies populate dark matter halos of 
similar mass. On scales between 100 kpc and 1 Mpc, AGN are clustered more weakly 
than the control galaxies. We use mock catalogues constructed from high-resolution N- 
body simulations to interpret this anti-bias, showing that the observed effect is easily 
understood if AGN are preferentially located at the centres of their dark matter halos. 
On scales less than 70 kpc, AGN cluster marginally more strongly than the control 
sample, but the effect is weak. When compared to the control sample, we find that 
only one in a hundred AGN has an extra neighbour within a radius of 70 kpc. This 
excess increases as a function of the accretion rate onto the black hole, but it does 
not rise above the few percent level. Although interactions between galaxies may be 
responsible for triggering nuclear activity in a minority of nearby AGN, some other 
mechanism is required to explain the activity seen in the majority of the objects in 
our sample. 

Key words: galaxies: clustering - galaxies: distances and redshifts - large-scale struc- 
ture of Universe - cosmology: theory - dark matter 



1 INTRODUCTION 

A major goal in the study of AGN has been to understand 
the physical mechanism(s) responsible for triggering accre- 
tion onto the central supermassive black hole and enhanced 
activity in the nucleus of the galaxy. From a theoretical 
standpoint, N-body simulations that treat the hydrodynam- 
ics of the gas have shown that interactions between galax- 
ies can bring gas from the disk to the central regions of 
the galaxy, leading to enhanced star formation in the bulge 
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jBarnes fc Hernauisdll99a iMihos fc Hernauistm 996l. It is 
then natural to speculate that some of this gas will be ac- 
creted onto the central supermassive black hole and that this 
will trigger activity in the nucleus of the galaxy. However, 
there has been little clear observational evidence in support 
of this hypothesis. 

Many observational studies have examined the correla- 
tions between AGN activity in galaxies and their local en- 
vironment. These analyses have pro duced contradictory re- 
sults . Early studies (se e for example iPetrosia n 1982; Daharil 
11984 iKeel et alJll985h noted that powerful Seyfert galax- 
ies appear to show an excess of close companions rela- 
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tive to their non-active counterparts. More recent analy- 
ses of larger and more complete samples of Seyfert galax- 
ies have reached t he opposite conclusion (e.g. ISchmhvfeOOll; 
iMiller et all I2003T) . Studies of X-ray selected AGN at in- 
termediate redshifts also find no evidence for excess near- 
neig hbour counts or enhanced levels of g alaxy asymme- 
try (iGrogin et alJl2005l : IWaskett et alJl2005T) . On the other 
hand, a recent study of the local environment of a sample 
of ~ 2000 q uasars at z < 0.4 d rawn from the Sloan Digital 
Sky Survey iSerber et al] 120061) concluded that quasars do 
have a significant local excess of neighbours when compared 
to L* galaxies , but only on small scales (< 0.2 Mpc). These 
authors found the the excess to be significantly larger for 
the most luminous systems. This study suggests that the 
disagreement between different studies may reflect the fact 
that they targeted AGN with different intrinsic luminosities. 

There have also been many studies of the large-scale 
clustering of AGN. This is usually quantified using the two- 
point correlation function (2PCF). In the standard model 
for structure formation, the amplitude of the two-point cor- 
relation function on scales larger than a few Mpc provides 
a direct measure of the mass of the dark matter halos that 
host the AGN. The large redshift surveys assembled in re- 
cent years, in particular the two- degree Field QSO Redshift 
Survey (^QZ MCroom et al.ll200ljh and the Sloan Digital Sky 
Survey (SPSS: lYork et aljhoOcT) . have allowed the cluster- 
ing of quasars to be studied with unprecedented accuracy. 
The cross-correlation between QSOs in the 2QZ and galax- 
ies in the two-degree Fi eld Galaxy Redshi ft Survey (2dF - 
GRS: IColless et al.l200lf) . measured bv lCroom et~aTI ]2005l) . 
is found to be identical to the autocorrelation of L, galaxies 
(a mean bias bgc = 0.97 ± 0.05). Measurements of the two- 
point correlation functio n of narrow- l ine A GN in the SDSS 
have been carried out bv lWake et alJ ll2004h . The results are 
similar to those for quasars - the amplitude of the AGN 
autocorrelation function is consistent with the autocorrela- 
tion function of luminous galaxies on scales from 0.2 to 100 
h~ Mpc. Similar results are found f or X-ray selected AGN 
jGilli et alJ 120051 iMullis et al]l2005l) . On the other hand, 
radio-loud A GN appear to be significantly more clustered on 
large scales jMagliocchetti et aJjll99sL 12004 lOverzier et~ai] 
2003), demonstrating that they reside in massive dark mat- 
ter halos. This is consistent with the fact that in the local 
Universe, radio-loud AGN are located in signific antly more 
mass iv e host galaxies than optical AGN (e.g. IBest et alJ 
120051) . IConstantin fc Vogelevl I2OO6I) analyzed AGN in the 
SDSS Data Release 2 sample and find that LINERs are 
more strongly clustered than Seyfert galaxies. Once again 
this is consistent with the fact that LIN ERs are found in 
more massive host galaxi es that Seyferts llKauffmann et aD 
l2003l:lKewlev et alJ^OOrj . 

In this paper, we analyze the clustering properties of 
89,211 narrow-line AGN selected from the Data Release 4 
(DR4) of th e Sloan D igital Sk y Surv ey using the procedure 
described in [Kauffmann et ajj (J2003J). Our methodology for 
computing correlati on funct i ons in the SDSS has been de- 
scribed in detail in iLi et aD (l2006h . where the dependence 
of clustering on physical properties such as stellar mass, age 
of the stellar population, concentration and stellar surface 
mass density was studied. In this paper, we extend this anal- 
ysis to AGN. Our approach differs from previous studies in 
the following ways: 



(i) We compute AGN-galaxy cross-correlations from 
scales of a few tens of kpc to scales of ~10 Mpc. This al- 
lows us to study the detailed scale dependence of the AGN 
clustering amplitude. 

(ii) We study how the clustering depends on both the 
black hole mass (estimated using the central velocity dis- 
persion of the galaxy) and the accretion rate relative to the 
Eddington rate (estimated using L[OIII], where L[OIII] is 
the [OIII]A5007 line luminosity, and Mbb is the black hole 
mass estimated from the central stellar velocity dispersion 
of the host). 

(iii) We wish to isolate the effect of the accreting black 
hole, so the clustering is always compared to the results ob- 
tained for control samples of inactive galaxies that are very 
closely matched to our AGN sample. We match simultane- 
ously in redshift, mass, and structural properties and we 
test the effect of additionally matching the age of the stel- 
lar population as characterized by the 4000 A break index 
D„(4000). 

(iv) We have constructed mock cat alogues using the hig h 
resolution Millennium Run simulation dSpringel et all2 005"). 
The mock catalogues match the geometry and selection 
function of galaxies in the DR4 large-scale structure sam- 
ple and they also reproduce the luminosity and stellar mass 
function of SDSS galaxies, as well as the shape and ampli- 
tude of the correlation functions in different bins of lumi- 
nosity and stellar mass. We use these catalogues to explore 
in detail how AGN trace the underlying galaxy and halo 
populations. 

Our paper is organized as follows: In section 2, we de- 
scribe the AGN and control samples that were used in the 
analysis. In section 3, we describe how we construct mock 
catalogues from the Millennium Simulation and use these 
to correct for effects such as fibre collisions. Section 4 de- 
scribes our clustering estimator; section 5 describes the ob- 
servational results and in section 6, we describe how we use 
our mock catalogues to extract physical information from 
the data. Finally in section 7, we summarize our results and 
present our conclusions. 



2 DESCRIPTION OF SAMPLES 

2.1 The SDSS Spectroscopic Sample 

The data analyzed in this study are drawn from the Sloan 
Digital Sky Survey (SDSS). The survey goals are to obtain 
photometry of a quarter of the sky and spectra of nearly one 
million objects. Imaging is obtained in the u, q, r, i, z bands 
jFukugita et alJll996l ISmith et ai]l2002l llvezic et al] | 2004T) 
with a special purpose drift scan camera (iGunn et al.lll99^ 
moun ted on the SDSS 2.5 meter telescope iGunnet^rt] 
2006) at Apache Point Observatory. The imaging dat a are 
photometrically llHogg et alJ 1200 it Trucker et al J [20051) and 
astrometrically jPier et al] l2003f) calibrated, and used to 
select stars, galaxies, and quasars for follow-up fibre spec- 
troscopy. Spectroscopic fibres are assigned to objects on the 
sky using an efficient tiling algori thm designed to optimize 
completeness jBlanton et al.l2 003 ) . The de tails of the survey 
strategy can be found in lYork et al] i2000T) and an overview 
of the data pipelines and products is provid ed in the Early 
Data Release paper (IStoughton et al.ll2002l) . More details 
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on th e photmetric pipeline can be found in iLupton et alJ 
feOOll) . 

Our parent sample for this study is composed of 397,344 
objects which have been spectroscopically confirmed as 
galaxies a nd have data publicly available in the SDSS Data 
Release 4 jAdelman- McCarthy et al"ll20 06). These galaxies 
are part of the SDSS ' main' galaxy sampl e used for large 
scale structure studies JStrauss et alJ 12002) and have Pet- 
rosian r magnitudes in the range 14.5 < r < 17.77 after 
correction for f oreground galactic extinction using t he red- 
dening maps of lSchlegel. Finkbeiner. fc Davis! Jl998). Their 
redshift distribution extends from ~ 0.005 to 0.30, with a 
median z of 0.10. 

The SDSS spectra are obtained with two 320-fibre spec- 
trographs mounted on the SDSS 2.5-meter telescope. Fibers 
3 arcsec in diameter are manually plugged into custom- 
drilled aluminum plates mounted at the focal plane of the 
telescope. The spectra are exposed for 45 minutes or un- 
til a fiducial signal-to-noise (S/N) is reached. The median 
S/N per pixel for galaxies in the main sample is ~ 14. The 
spectra are processed by an automated pipeline, which flux 
and wavelength calibrates the data from 3800 to 9200 A. 
The instrumental resolution is R = \/S\ = 1850 - 2200 
(FWHM~ 2.4 A at 5000 A). 

2.2 The AGN and control samples 

We have performed a careful subtraction of the stel- 
lar absorption-line spectrum before measuring the neb- 
ular emission-lines. This is accomplished by fitting the 
emission-line-free regions of the spectrum with a model 
galaxy spectru m computed using the new population syn- 
thesis code of iBruzual fe Charlod <l2003t) . which incorpo- 
rates a high resolution (3 A FWHM) stellar library. A set 
of 39 model template spectra were used spanning a wide 
range in age and metallicity. After convolving the template 
spectra to the measured stellar velocity dispersion of an 
individual SDSS galaxy, the best fit to the galaxy spec- 
trum is constructed from a non-negative linear combina- 
tion of the template spectra. Further details are given in 
iTremonti et alJ i2004f) . Physical parameters such as stellar 
masses, metallicities, and star formation rates have been es- 
timated using the spectra and these are publically available 
at http://w ww.mpa-garchin g.mpg.de/SDSS / The reader is 
referre d to ITremonti et alJ j2004h and iBrinchmann et ail 
(2004) for more details. 

AGN are selected from the subset of galaxies with 
S/N > 3 in the four emission lines [OIII]A5 007, H/3, 
[NIIJA6583, Ha. Following lKauffmann et aiH2003ft . a galaxy 
is defined to be an AGN if 

log([OIII]/H/3) > 0.61/(log([NII]/Ha) - 0.05) + 1.3. (1) 

We divide all the AGN into three subsamples according 
to logarithmic stellar velocity dispersion log 10 cr* . It is also 
interesting to study how clustering depends on the strength 
of nuclea r activity in the galaxy. In order to adress this issue, 
we follow IHeckman et al.l (120041) and use the [O ill] emission 
line luminosity as an indicator of the rate at which matter 
is accreting onto the central supermassive black h ole, and 
we use the relation given in lTremaine et alJ 1120021) to esti- 
mate black hole masses from the stellar velocity dispersion 
measured within the fibre aperture. We then use the ratio 



L[0 iii]/M_bh as a measure of the accretion rate relative to 
the Eddington rate, to define subsamples of "powerful" and 
"weak" AGN. (Note that in the current analysis, L[OIII] is 
corrected for dust extinction.) The AGN in each log 10 cr* 
subsample are ordered by decreasing L[0 ill] /Mb h- The 
top 25% are defined as "powerful" and the bottom 25% are 
"weak" . For each AGN sample, we construct 20 control sam- 
ples of non-AGN by simultaneously matching four physical 
parameters: redshift, stellar mass, concentration and stellar 
velocity dispersion. We have also constructed control sam- 
ples where the 4000 A break strength is matched in addition 
to these parameters. The matching tolerances are Acz < 500 
km s~\ AlogM* < 0.1, Act* < 20 km s" 1 , AC < 0.1 and 
AD n (4000)< 0.05. 

We correct the [OIII] luminosities of the AGN in our 
sample for dust using the difference between the observed 
Ha/H/3 emission line flux ratios and the case-B recombina- 
tion value (2 .86). We assumed an attenuation law of the form 
t\ oc A~ 0,7 dCharlot fc Fallll200(J) This procedure has clear 
physical meaning in the "pure" Seyfert 2's and LINERs. In 
the case of the transition objects, the lines will arise both in 
the NLR and the surrounding HII regions, with a greater rel- 
ative AGN contribution to [OIII] than to the Balmer lines. 
Thus, a dust correction to [OIII] based on the ratio Ha/H/3 
should be regarded as at best approximate. 

2.3 Reference galaxy sample 

We use the New York University Value Added Galaxy Cat- 
alogue (NYU-VAGC) 1 to construct a reference sample of 
galaxies, which are cross-correlated with the AGN sample. 
The original NYU-VAGC is a catalo gue of local galaxies 
(mostly below z ta 0.3) constructed bv lBlanton et all (2005) 
based on the SDSS DR2. Here we use a new version of the 
NYU-VAGC (Sample dr4), which is based on SDSS DR4 
The N YU-VAGC is described in detail in iBlanton etafl 
J2005I) . 

We have constructed two reference samples: 1) a spec- 
troscopic reference sample , which is used to compute the 
projected AGN-galaxy cross-correlation function w(r p ); 2) 
a photometric reference sample, which is used to calculate 
counts of close neighbours around AGN 

The spectroscopic reference sample is constructed by 
selecting from Sample dr4 all galaxies with 14.5 < r < 17.6 
that are identified as galaxies from the Main sample (note 
that r-band magnitude has been corrected for foreground 
extinction). The galaxies are also restricted to the redshift 
range 0.01 ^ z ^ 0.3, and the absolute magnitude range 
— 23 < Mo.i T < —17. The spectroscopic reference sam- 
ple contains 292,782 galaxies. We do not consider galaxies 
fainter than Mo.i r = —17, because the volume covered by 
such faint samples is very small and the results are subject 
to large erro rs as a result o f cosmic variance (see for exam- 
ple Fig. 6 of lLi et al.ll2006Tl . The faint apparent magnitude 
limit of 17.6 is chosen to yield uniform galaxy sample that 
is complete over the entire area of the survey. 

The photometric reference sample is also constructed 
from Sample dr4 by selecting all galaxies with 14.5 < 
r < 19. The resulting sample includes 1,065,183 galaxies. 

1 http:/ /wassup. physics. nyu.edu/vagc/ 
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Throughout this work we adopt standard ACDM cosmolog- 
ical parameters: Q = 0.3, Qa = 0.7, and Ho = 70 km s _1 
Mpc" 1 . 



3 MOCK CATALOGUES 

In this section we describe how we construct a large set of 
mock galaxy samples with the same geometry and selection 
function as the spectroscopic samples described in the pre- 
vious section . We will use these mock catalogues to test our 
method for correcting for the effect of fibre collisions on the 
measurement of the AGN-galaxy cross-correlation function 
on small scales. We will also use these mock catalogues to 
construct models of AGN clustering for comparison with the 
observations. 

3.1 Galaxy properties in the Millennium 
Simulation 

Our mock c atalogues are constru cted using the Millennium 
Simulation iSpringel et al.ll2005l) . a very large simulation 
of the concordance ACDM cosmogony with 10 10 particles. 
The chosen simulation volume is a periodic box of size 
Lbox = 500/i." 1 Mpc on a side, which implies a particle mass 
of 8.6 x 10 s 4/i _1 M©. Haloes and subhaloes at all output 
snapshots are identified using the subf ind algorithm de- 
scribed in lSpringel et all i200 ill and merger trees are then 
construc ted that desc r ibe ho w haloes grow as the Universe 
evolves. ICroton et alJ i2006T) implemented a model of the 
baryonic physics in these simulations in order to simulate 
the formation and evolutio n of galaxies and the ir central su- 
permassive black holes fsee lCroton et ai]l2006t for more de- 
tails) . This model produced a catalogue of 9 million galaxies 
at z — down to a limiting absolute magnitude limit of M r — 
5 log ft = —16.6. This catalogue is well-matched to many 
properties of the present-day galaxy population (luminosity- 
colour distributions, clustering etc.). It is publically available 
at http://www.mpa-garching.mpg.de/galform/agnpaper In 
our work, we adopt the positions and velocities of the galax- 
ies given in the Croton et al. catalogue. The r-band lumi- 
nosities and stellar masses are assigned to each model galaxy, 
using the parametrized functions described bv lWang et alJ 
(2006). These functions relate the physical properties of 
galaxies to the quantity Mmfall, defined as the mass of the 
halo at the epoch when the galaxy was last the central dom- 
inant object in its own halo. They were chosen so as to 
give clos e fits to the re s ults o f the physical galaxy formation 
model of lCroton et al] (j2006), but their coefficients were ad- 
justed to improve the fit to the SDSS data, in particular the 
galaxy mass function at the low mass end. Extensive tests 
have shown that the adopted parametrized relations allow 
us to accurately match the luminosity and stellar mass func- 
tions of galaxies in the SDSS, as well as the shape and am- 
plitude of the two point correlation function of galaxies in 
diffe rent luminosi ty and stellar mass ranges jli et aljl200H 
IWang et al]l2006ft . 

3.2 Constructing the catalogues 

Our aim is to construct mock galaxy redshift surveys that 
have the same geometry and selection function as the SDSS 



DR4. A detailed account of the observational selection ef- 
fects accompanies the NYU-VAGC release. The survey ge- 
ometry is expressed as a set of disjoint convex spherical 
polygons, defined by a set of "caps". This methodology was 
developed by Andrew Hamilton to deal accurately and ef- 
ficiently with the complex a ngular masks of galaxy surveys 
jHamilton fc Tegmarkll2002l) 2 . The advantage of using this 
method is that it is easy to deter mine whether a point is in- 
side or outside a given polygon (iTegmark et aljl2002l) . The 
redshift sampling completeness is then defined as the num- 
ber of galaxies with redshifts divided by the total number 
of spectroscopic targets in the polygon. The completeness is 
thus a dimensionless number between and 1, and it is con- 
stant within each of the polygons. The limiting magnitude 
in each polygon is also provided (it changes slightly across 
the survey region). 

We con struct our mock c atalogues using the methods 
described in lYang et alJ i2004f) . except that we position the 
virtual observer randomly inside the simulation and not at 
the centre of the box. Because the survey extends out to 
z ~ 0.3, this implies that we need to cover a volume that 
extends to a depth of 900/i _1 Mpc, i.e. twice that of the 
Millennium catalogue. We thus create 5x5x5 periodic 
replications of the simulation box and place the observer 
randomly within the central box, so that the required depth 
can be achieved in all directions for the observer. 

We produce 20 mock catalogues by following the proce- 
dure described below: 

(i) We randomly place a virtual observer in the stack of 
boxes described above. We define a (a,5)-coordinate frame 
and remove all galaxies that lie outside the survey region. 

(ii) For each galaxy we compute the redshift as "seen" 
by the virtual observer. The redshift is determined by the 
comoving distance and the peculiar velocity of the galaxy. 

(iii) We compute the r-band apparent magnitude of each 
galaxy from its absolute magnitude M r and its redshift, ap- 
plying a (negative) K-correction but neglecting any evolu- 
tionary correction. We then select galaxies according to the 
position-dependent magnitude limit (provided in the Sample 
dr4) and apply a (positive) K-correction to compute Mo.i r , 
the r-band absolute magnitude of the galaxy at z — 0.1. 

(iv) To mimic the position-dependent completeness, we 
randomly eliminate galaxies using the completeness masks 
provided in Sample dr4. 

(v) Finally, we mimic the actual selection criteria of our 
own reference sample by restricting galaxies in the mock 
catalogue to 0.01 < z < 0.3, 14.5 < r < 17.6 and -23 < 
Mo.i r < -17. 

Figure shows the equatorial distribution of galaxies in 
one of our mock catalogues, compared to that in the observa- 
tional sample. The average number of galaxies in our mock 
catalogues is ~320,000, with a r.m.s. dispersion of ~ 9000, 
in good agreement with the observed number. 

3.3 Fibre collisions 

The procedure described above does not account for the fact 
that the spectroscopic target selection becomes increasingly 

2 http:/ /casa. colorado.edu/~ajsh/mangle/ 
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Figure 1. Equatorial distribution of right ascension and redshift for galaxies within 6° of the equator in the SDSS (left) and in one of 
our mock catalogues (right). 



incomplete in regions of the sky where the galaxy density 
is high, because two fibres cannot be positioned closer than 
55 arcseconds from each other. In order to mimic these fibre 
"collisions", we modify step (4) above. We no longer ran- 
domly sample galaxies using the completeness masks. In- 
stead, we assign fibres to our mock galaxies using a proce- 
dure that is designed to mimic the tiling co de that a ssigns 
spect roscopic fibres to SDSS target galaxies (IBlanton et alJ 
2003). We run a friends-of- friends grouping algorithm on the 
mock galaxies with a 55" linking length. Isolated galaxies, 
i.e. those in single-member groups, are are always assigned 
fibres. If two targets collide, we simply pick one at random 
to be the spectroscopically targeted galaxy. For groups with 
more than two members, we need to determine which mem- 
bers will be assigned fibres and which will not. The resulting 
groups are almost always of sufficiently low multiplicity so 
that in principle, one could simply check all possibilities to 
find the best possible combination of galaxies that would 
eliminate fibre collisions (this is the procedure adopted in 
the SDSS target selection). In our work we use a somewhat 
more efficient method. For each group, we calculate the ge- 
ometric centre. The group members located closer to the 
center are preferentially eliminated. For example, in a triple 
collision this algorithm will keep the outer two members 
rather than the middle one. 

This procedure is complicated by the fact that some 
fraction of the sky will be covered with overlaps of differ- 
ent tiles (Each spectroscopic fibre plug plate is referred as 
a "tile", which has a circular field of view with a radius of 
1°49.). About 30 % of the sky is covered in such overlaps. 
This means that if, for example, a binary group covered 
by two ore more tiles, both of the group members can be 
assigned fibres. We take this into account by iteratively re- 
peating the procedure described above, as follows. 

(i) First, we determine the number of tiles that cover a 
given galaxy and set the quantity n c fc ances equal to this num- 
ber. For example, n c h ances = 2 for a galaxy covered by two 
tiles; this galaxy has two chances to be assigned a fibre. 

(ii) We assign fibres by applying the algorithm described 
above to those galaxies with nuances > 0. If a galaxy obtains 



a fibre in this procedure, then the quantity nuances is set 
to zero. If not, we set n c h ances = n c hances — 1. 

(iii) Step (ii) is repeated until n c h ances reaches zero for all 
galaxies. All galaxies that are not assigned fibres are then 
removed from our mock catalogue. 

(iv) Finally, we remove a number of galaxies at random 
so that the resulting mock sample has the same overall 
position-dependent completeness as the real SDSS sample. 



4 CLUSTERING MEASURES 

In order to compute the two-point cross-correlation func- 
tion (2PCF) £(r p ,7r) between the AGN host (or matched 
control) sample and the reference galaxy sample , we have 
constructed random samples that are designed to include 
all ob servational sele ction effects. This is described in de- 
tail in lLi et alJ <2006ft . £(r p ,7r) is then calculated using the 
estimator 



£0>,7r) 



N R QD(r p ,Tv) 
No QR(r p ,n) 



1. 



(2) 



where r p and 7t are the separations perpendicular and par- 
allel to the line of sight; Nd and Nr are the number of 
galaxies in the reference sample and in the random sample; 
QD(r p ,n) and QR(r p ,n) are the cross pair counts between 
AGN (or control) and the reference sample, and between 
AGN (or control) and the random sample, respectively. 

In what follows, we focus on the projection of £(r p ,7r) 
along the line of sight: 



w p (r p ) 



£(r p , tt)c1-k = >J f (r p , %i)Awi 



(3) 



Here the summation for computing w p (r p ) runs from 7Ti = 
-39.5 h" 1 Mpc to 7T8o = 39.5 h" 1 Mpc, with Att, = 1 h" 1 
Mpc. 

The errors on the clustering measurements are 
estimated using the bootstrap resampling technique 
jBarrow. Bhavsar. fc Sonodalll984) . We generate 100 boot- 
strap samples from the observations and compute the 
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correlation functions for each sample using the weight- 
ing scheme (but not the a pproximate formula) given by 
IMo. Jing. fc Boernerl il992T) . The errors are then given by 
the scatter of the measurements among these bootstrap sam- 
ples. More details about our procedu r es and tests of our 
methodology can be found in iLi et al.l <2006ft . We also ob- 
tain robust estimates of our uncertainties from the scatter 
between the results obtained from disjoint areas of the sky. 

We are particularly interested in the amplitude of the 
AGN-reference galaxy cross-correlation on small scales (< 
100 kpc), because this allows us to evaluate whether merg- 
ers and interactions play a role in triggering AGN activity. A 
careful correction for the effect of fibre collisions when mea- 
suring the clus tering is thus very important. As described in 
ILi et alJ j2006ft . we correct for fibre collisions by comparing 
the angular 2PCF of the spectroscopic sample with that of 
the parent photometric sample. Here we use our mock SDSS 
catalogues to test the correction method. We calculate the 
angular correlation functions w z (d) and w p (6) using mock 
catalogues with and without fibre collisions. The function 



F{8) = 



l + w p {9) 



(4) 



is then used to correct for collisions by weighting each pair 
by 1/F(9). Figure |21 (the top panel) shows measurements of 
w p (r p ) for galaxies in one of our mock catalogues. The solid 
line is the "true" correlation function calculated for the mock 
catalogues that do not include fibre collisions. Circles show 
the results when fibre collisions are included. Triangles show 
the results that are obtained when the 2PCF is corrected 
for the effect of fibre collisions using the method described 
above. In the bottom panel, we plot the ratios of the un- 
corrected and the corrected w p (r p ) relative to the "true" 
correlation function. As can be seen, our correction proce- 
dure works well. The turnover in the amplitude on small 
physical scales resulting from the lower sampling of galaxies 
in dense regions disappears and the corrected w p (r p ) is very 
close to the real one. It is noticeable that there is still a very 
small deficit in the corrected w p (r p ) on scales between 0.05 
and 1 Mpc. This should not be a significant contribution 
to the bias between AGN and normal galaxies (see below), 
because fibre collisions are expected to affect the AGN and 
the reference galaxies in the same way. 



5 OBSERVATIONAL RESULTS 
5.1 AGN Bias 

We first compute w p GN ^ rcf (r p ), the cross-correlation of the 
AGN sample with respect to the reference sample. As de- 
scribed in section 2, we have constructed two sets of 20 
control samples. The first set is constructed by simulta- 
neously matching redshift, stellar mass, concentration and 
stellar velocity dispersion, and the second set by addition- 
ally matching the 4000A break strength. We then compute 
w p ontI ^ Tct (r p ), the average cross-correlation of the control 
samples with respect to the reference sample. The quan- 
tity w p GN ^ rcf (r p )/w p ° ntI ^ ct (r p ) then measures the bias of 
the AGN sample with respect to the control sample of non- 
AGN as a function of projected radius r p . 

The results are shown in Fig. 3. In the top panel, we 

1 j- AGN/ref/ \ . i -contr/rcf/ \ . l j. j r 

plot w p (r p ) as circles. w p (r p ) is evaluated tor 
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Figure 2. Top: Projected 2PCF w p (r p ) measured for the mock 
catalogue without including fibre collisions (solid line) and for the 
mock catalogue with fibre collisions (circles). The filled triangles 
show the measured w p (r p ) for the mock after correcting the effect 
of fibre collisions using the method described in the text. Bottom: 
The ratios of the uncorrected and the corrected w p (r p ) relative 
to the "true" Wp(r p ). 



the two sets of control samples and the results are plotted as 
squares for the first set and triangles for the second set. The 
measurement errors are estimated using the bootstrap re- 
sampling technique described in the previous section. In the 

-contr/ref / 



bottom panel, we plot the ratio w p GN ^ ret (r p )/w p ° 



'(r P ) 



for the two control samples, The errors are estimated in the 
same manner as in the top panel. For clarity, squares and 
triangles in both panels have been slightly shifted along the 
r p -axis. 

Figure 3 shows that there exists a scale-dependent 
bias in the distribution of AGN relative to that of nor- 
mal galaxies. In particular, the ratio between the two cross- 
correlations appears to exhibit a pronounced "dip" at scales 
between 100 kpc and 1 Mpc. We note that errorbars esti- 
mated using the bootstrap resampling technique do not take 
into account effects due to cosmic variance. The coherence 
length of the large scale structure is large and even in a 
survey as big as the SDSS, this can induce significant fluc- 
tuations in the amplitude of the correlation function from 
one part of the sky to another. The difference in the clus- 
tering amplitude of AGN and non-AGN shown in Figure 3 
is only a 10-30% effect , so it is important to test whether 
the dip seen in Figure 3 is truly robust. 

We have thus divided the survey into 6 different areas 
on the sky. Each subsample includes ~ 12, 000 AGN. We 
recompute the AGN bias for each of these subsamples and 
the results are shown in Fig. 4. Note that in this plot, we 
only use a single control sample to compute the bias, not 
the average of 20 control samples as in Figure 3. The scat- 
ter between the different curves in Figure 4 thus provides an 
upper limit to the true error in the measurement of the bias. 
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1.4 



Figure 3. Top: projected cross-correlation function 
Wp GN ^ Tci (r p ) between AGN and reference galaxies is plot- 
ted as circles. The average cross-correlation between two sets of 
20 control samples of non-AGN and the same reference galaxies 
is plotted as squares and triangles (squares: redshift, stellar 
mass, stellar velocity dispersion and concentration are matched; 
triangles: the control samples are constructed by additionally 
matching the 4000A-break strength). The inset compares the 
D4000 distribution for the two sets of control samples, with red 
for the former set and blue for the latter. The histogram shows 
the D n (4000) distribution for the AGN. Bottom: ratio of the 
w p (r p ) measurement of AGN to that of non-AGN. Symbols are 
the same as in the top panel. 
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Figure 4. Ratio of the w p (r p ) measurement of AGN to that of 
non-AGN for six disjoint regions of the survey. 
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Figure 7. Ratio of the w p (r p ) measurement of AGN to that of 
non-AGN for 20 different non-AGN control samples. The yellow 
shaded region shows the ratio of the w p (r p ) measurements of the 
different non-AGN control samples relative to each other. 



As can be seen, on small scales (< 50 kpc) , the different 
subsamples scatter in bias above and below unity. On scales 
between 0.1 and 1 Mpc, however, all 6 subsamples lie sys- 
tematically below this line. On scales larger than 1-2 Mpc, 
five out of six subsamples show bias values below unity, but 
the effect is clearly less significant than on scales between 
0.1 and 1 Mpc. In Fig. 5, we examine the dispersion in the 
bias measurement for the AGN sample as a whole caused 
by differences between the control samples. As can be seen, 
the scatter in the measurement of the bias between differ- 
ent control samples is considerably smaller than the scatter 
between different survey regions, showing that that cosmic 
variance is, in fact, the dominant source of uncertainty in 
our results. Once again, there is clear indication that AGN 
are antibiased relative to the control galaxies on scales larger 
than 100 kpc. 

We conclude that on scales between 0.1 and 1 Mpc, 
AGN are significantly anti-biased relative to non-AGN of the 
same stellar mass, concentration and stellar velocity disper- 
sion. Figure 3 shows that this anti-bias persists even when 
the mean age of the stellar population is matched in ad- 
dition to stellar mass and structural parameters. We note 
that these scales are comparable to the diameters of the 
dark matter halos that are expected to host galaxies with 
stellar masses comparable to the objects in our sample. In 
section 6, we construct halo occupation (HOD) models using 



mock catalogues constructed from the Millennium Simula- 
tion that can explain the anti-bias on these scales. As we 
will show, the same models naturally predict a smaller, but 
significant antibias on scales larger than 1 Mpc. 

5.2 Dependence on black hole mass and AGN 
power 

It is interesting to study how AGN clustering depends on 
black hole mass and the strength of nuclear activity in the 
galaxy. As described above, we use the stellar velocity dis- 
persion as an indicator of the black hole mass and divide all 
AGN into three subsamples according to log 10 a*. We then 
use the ratio L[0 iii]/M_bh as a measure of the accretion 
rate relative to the Eddington rate. We rank order all the 
AGN in a given interval of stellar velocity dispersion accord- 
ing to L[0 iii]/Msh and we define subsamples of "powerful" 
and "weak" AGN as those contained within the upper and 
lower 25th percentiles of the distribution of this quantity. 

The results are shown in Figure 6. Panels from left to 
right correspond to different intervals of log 10 <r* , as indi- 
cated at the top of the figure. The first two rows show the 
w p (r p ) measurements for the AGN and the corresponding 
control samples. The third row shows the ratio between the 
two. Red (blue) lines correspond to the powerful (weak) sub- 
samples. Black lines show results for the sample as a whole. 



© 2006 RAS, MNRAS OOO.mm 



8 Li, Kauffmann, Wang, White & Heckman 




Figure 5. Top: projected cross-correlation w p (r p ) in different cr, bins (indicated above each panel), for all AGN (black), powerful (red) 
and weak (blue) AGN. The powerful (weak) AGN are denned as the top (bottom) 25 per cent objects ordered by decreasing L[0 in]/M. . 
The middle row is for control samples of non-AGN and the bottom row shows the ratio between the results for the AGN and the control 
samples. The insets in the bottom panels compare results for all AGN using different control samples. Black is for control samples 
constructed by matching rcdshift, stellar mass, stellar velocity dispersion and concentration, while red is for control samples where the 
4000Abreak strength is also matched. 
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Figure 6. w p (r p ) in different L[L m]/M. bins (indicated above each panel), for all AGN (solid) and for control samples of non-AGN 
(dashed). The small panels give the ratio between the above two (black). The red lines are results where the 4000A-break strength is 
also matched when constructing control samples. 
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As mentioned previously, there are two sets of control 
samples: sample 1 is constructed by simultaneously match- 
ing redshift, stellar mass, concentration and stellar velocity 
dispersion; sample 2 is constructed by additionally matching 
the 4000A break strength. For clarity, the main panels in in 
Figure 6 show the results only for sample 1. The insets in the 

i , , , . AGN/rcf / \ / -contr/rcf / \ 

bottom row compare the ratio w p \ r v)/ w p \ r v) 

for the two control samples (black corresponds to set 1 and 
red to set 2). The two different control samples give very 
similar results on scales less than a few Mpc, but the large- 
scale antibias is more pronounced for sample 2 (note that 
this is also seen in Figure 2). 

As can be seen, the "dip" in clustering on scales between 
0.1 and 1 Mpc is most pronounced for AGN with the largest 
central stellar velocity dispersions and the highest accretion 
rates. On scales smaller than 0.1 Mpc, more powerful AGN 
appear be somewhat more strongly clustered than weaker 
AGN and more strongly clustered than galaxies in the con- 
trol sample. The error bars on the measurements are large, 
however, and effect is not of high significance. In Figure 7 we 
plot results for AGN of all velocity dispersions, but now in 
four different intervals of L[0 iii]/M_bh, as indicated at the 
top of the figure. Once again we see a marginal tendency 
for AGN with higher values of L[0 iii]/M_bh to be more 
strongly clustered on small scales. 

5.3 Close Neighbour Counts 

As we have discussed, one of the problems with computing 
the galaxy-AGN cross- correlation function on very small 
physical scales in the SDSS is that corrections for the effect 
of fibre collisions are required. These corrections are statis- 
tical in nature and even if they are correct on average, they 
may still introduce systematic effects in our analysis. An 
alternative approach is to count the number of galaxies in 
the vicinity of AGN in the photometric sample, which is not 
affected by incompleteness. The disadvantage of using the 
photometic sample is that many of the close neighbours will 
not be truly nearby systems, but rather chance projections 
of foreground and background galaxies that lie along the 
line-of-sight. We can make a statistical correction for this 
by evaluating the counts around randomly placed "galax- 
ies" with the same assumed joint distribution of apparent 
magnitude and redshift as the AGN (control) samples. 

In the upper panel of Figure 8 we plot the average cor- 
related neighbour count (i.e. after statistical correction for 
uncorrelated projected neighbours) within a given value of 
the projected radius R p for the AGN sample (red) and the 
control samples (blue). The lower panel and its insets show 
the difference betweeen the average correlated counts for the 
AGN and control samples as a function of R p . The variance 
in the counts around the control galaxies estimated from the 
20 different control samples is shown in yellow. The AGN 
sample has a r-band limiting magnitude of 17.6 and the pho- 
tomteric reference sample that we use is limited at r=19.0. 
In order to ensure that we are counting similar neighbours 
at all redshifts, the counts only include those galaxies with 
t < tagn + 1-4 mag. In this analysis, the control sample is 
matched in r-band apparent magnitude as well as redshift, 
stellar mass, velocity dispersion and concentration. This en- 
sures that we are counting galaxies to the same limiting 
magnitude around both the AGN and the control galaxies. 



Figure 8 shows that the counts around the AGN and 
the control galaxies match well on large scales. On small 
scales, there is a small but significant excess in the number 
of neighbours around AGN out to scales of ~ 70 kpc. As 
may be seen from the bottom panel of Figure 8, AGN are 
appromimately twice as likely to have a near neighbour as 
galaxies in the control sample. This does not mean, however, 
that every AGN has a close companion. Figure 8 also shows 
that only one in a hundred AGN has an additional close 
(R p < 70 kpc) neighbour as compared to the control galax- 
ies. On scales larger than 100 kpc, the pair counts around 
the AGN dip below the counts around the control samples, 
leading to the "anti-bias" discussed in the previous section. 
This may be compensated on scales larger than several Mpc, 
although such compensation is not required with our present 
statistics. 

Figure 9 shows the counts around AGN in four dif- 
ferent ranges of L[OIII]/ Afe.fr. As can be seen, the excess 
on small scales increases as a function of the accretion rate 
onto the black hole. However, the excess affects only a few 
percent of the AGN, even for t he objects in our h ighest 
L[Olll]/M B H bin. We note that ISerber et al.l (§006) ana- 
lyzed galaxy counts around quasars compared to L* galaxies 
at the same redshift and found a clear excess on scales less 
than 100 kpc, very similar to the ~ 70 kpc scales where we 
see the upturn in the counts around our sample of narrow- 
line AGN. Serber et al also found that the excess was largest 
for the most luminous quasars; the excess count reached val- 
ues ~ 1 (i.e. significantly larger than the excess found for the 
most powerful narrow-line AGN in our sample) for quasars 
with i-band magnitudes brighter than —24. If we use the re- 
lation betwee n [Olllj line luminosity and quasar continuum 
luminosity of IZakamska et ail i2003l) to compare the AGN 
in our sample to the quasars studied by Serber et al, we find 
that the luminosities where quasars begin to exhibit a sig- 
nificant excess count lie just beyond those of the AGN that 
populate our highest L[OIII]/Ms_ff bin. 

Our conclusion, therefore, is that we do not find strong 
evidence that interactions and mergers are playing a signif- 
icant role in triggering the activity in typical AGN in the 
local Universe. One caveat that should be mentioned is that 
if the activity is triggered after the merger has already taken 
place, our pair count statistics would not be a good diag- 
nostic. In order to assess this, more work is needed to assess 
whether AGN exhibit any evidence for disturbed morpholo- 
gies or stuctural peculiarities. 



6 INTERPRETION OF AGN CLUSTERING 
USING HALO OCCUPATION MODELS 

In the previous section we showed that the main difference 
in the AGN-galaxy cross-correlation function with respect 
to that of a closely matched control sample of non-AGN is 
that AGN are more weakly clustered on scales between 100 
kpc and 1 Mpc. On larger scales, there is a much smaller 
difference in the clustering signal of AGN and non-AGN. 
The clustering amplitude of AGN on large scales provides 
a measure of the mass of the dark matter halos that host 
these objects. The fact that there is only a small difference 
between the AGN and the control sample tells us that AGN 
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Figure 8. Top: Average counts of galaxies in the photometric sample (rn m < 19) within a given projected radius R p from the AGN 
(red) and from the control galaxies (blue). Bottom: The difference between the counts around the AGN and the control galaxies is plotted 
as a function of R p . The yellow bands indicate the variance in the results between the 20 different control samples. 
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Figure 9. Same as the top panel of Fig. [5] but for four subsamples of AGN with different L[0 m]/M, , as indicated in each panel. 
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Figure 10. Top: The AGN/reference cross-correlation func- 
tion calculated from the mock catalogues is plotted as solid 
curves, while the control/reference cross-correlations are plotted 
as dashed curves. The different colours indicate models in which a 
given percentage (as indicated on the plot) of the AGN are located 
at the centers of their own dark matter halos. Bottom: The ra- 
tio between the AGN / reference galaxy cross-correlation functions 
and the control galaxy/ reference galaxy correlation functions are 
plotted for the same set of models. 



are found in roughly similar dark matter halos to non-AGN 
with the same stellar masses and structural properties. 

The physical scales of 0.1-1 Mpc where we do see strong 
differences in the clustering of AGN and non-AGN are sim- 
ilar to the virial diameters of the dark matter halos that 
are expected to host ga laxies with luminosities of ~ L, 
jMandelbaum et al.l (2006) . The simplest interpretation of 
the AGN anti-bias on these scales, therefore, is that AGN 
occupy preferred positions within their dark matter halos 
where conditions are more favourable for continued fuelling 
of the central black hole. One obvious preferred location 
would be the halo centre where gas is expected to be able 
to reach high enough overdensities to cool via radiative pro- 
cesses. In the main body of the more massive dark mat- 
ter halos, most of the surrounding gas will have been shock 
heated to the virial temperature of the halo and will no 
longer be able to cool efficiently. In addition, the vast ma- 
jority of galaxy-galaxy mergers within a halo will occur with 
the g alaxy that is located at the halo centre llSpringel et alJ 

In this section, we use our mock catalogues to test 
whether a model in which AGN are preferentially located 
at the centres of dark matter halos can fit our observa- 
tional results. As mentioned in section 2 , we h ave used 
the methodology introduced bv lWang et alJ i2006f) to assign 
stellar masses to the galaxies in the catalogues. These au- 
thors adopted parametrized functions to relate galaxy prop- 
erties such as stellar mass to the quantity Minfatt, defined 
as the mass of the halo at the epoch where the galaxy was 
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Figure 11. Top: The AGN/reference cross-correlation function 
calculated from the SDSS is plotted as solid circles. The open 
circles show the control galaxy /reference galaxy cross-correlation 
function from the SDSS. Results for AGN and control galaxies for 
our best-fit model are plotted as solid and dashed red lines. Bot- 
tom: Solid circles show the ratio w AGN ^ cl (r p )/w contIol ^ ct {r p ) 
for the SDSS sample. The solid red curve shows the result for our 
best-fit models. The error bars indicate the uncertainty due to 
cosmic variance as estimated from 20 different mock catalogues 
(see text for more details). The dashed red curves indicate the 
variance between different control samples from different mock 
catalogues (see text). 



last the central dominant object in its own halo. It was 
demonstrated that these parametrized relations were able 
to provide an excellent fit to the basic statistical properties 
of galaxies in the SDSS, including the stellar mass function 
and the shape and amplitude of the two-point correlation 
function function evaluated in different stellar mass ranges. 
We now introduce a simple model in which pagn , the prob- 
ability of a galaxy to be an AGN depends only on whether 
it is the central galaxy of its own halo. 

In order to create mock AGN and control catalogues 
that we can compare directly with the observational data, we 
follow the following procedure. For every AGN in our sam- 
ple, we select galaxies from the mock catalogue that have the 
same stellar mass and the same redshift. We then choose an 
AGN from among these galaxies based on whether they are 
central or satellite systems. The control galaxies are selected 
at random from the same set. The AGN and control sam- 
ples are then cross-correlated with a reference sample that 
is drawn from the mock catalogue in exactly the same way 
as our real SDSS reference sample. The top panel in Figure 
10 shows how the AGN/reference galaxy cross-correlation 
function changes as a function of the fraction of AGN that 
are central galaxies. Note that if the probability of being an 
AGN is independent of whether the galaxy is a central or 
a satellite system, 73% of the AGN will be central galax- 
ies. The bottom panel of Figure 10 shows how the ratio 
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between the AGN and control galaxy cross-correlation func- 
tions varies as this fraction changes. 

As the fraction of centrally-located AGN increases, the 
"dip" on scales smaller than 1 Mpc becomes more and more 
pronounced. There is also a small decrease of the ratio 
lu AGN / ref (r p )/w control/ref (r p ) on large scales. The latter ef- 
fect arises because, as more and more AGN are required to 
be central galaxies in their own halos, they also shift into 
lower mass halos. High mass halos are less abundant than 
low mass halos and by definition, each halo can only contain 
one central galaxy. As the central galaxy criterion on AGN 
becomes more stringent, fewer AGN will reside in halos of 
10 14 - 10 15 M Q and more in halos with 10 12 - 1O 13 M . The 
biggest effect, however, the dip on scales less than 1 Mpc, 
results from the fact that central galaxies in lower mass ha- 
los have fewer neighbours than non-central galaxies within 
rich groups. 

In Figure 11, we compare the results of our simple model 
with the observational data. In the top panel, the solid and 
open circles show the cross-correlation functions of the AGN 
and the control samples respectively. The solid and dashed 
lines show our best-fit models, in which 84% of all AGN are 
located at the centres of their own dark matter halos. In 
the bottom panel we compare the ratio of w(r p ) for the ob- 
served AGN and control galaxies with that obtained for the 
model. The error bars plotted on the model curve provide 
an estimate of the uncertainty in the result due to cosmic 
variance effects. In order to estimate these errors, we have 
created 20 different mock catalogues by repositioning the 
virtual observer at random within the simulation volume. 
For each mock catalogue, we repeat our computations of 
the AGN and control galaxy cross-correlation functions. The 
error bars are then calculated by looking at the variance in 
the ratio w AGN/rc{ (r p )/w contIol/lct (r p ) for AGN and control 
galaxies selected from different catalogues. The errors esti- 
mated in this way are similar in size to the errors estimated 
by calculating the variance in iu COTltrol/ ' ref (r p ) from different 
mock catalogues; these errors are indicated by the dashed 
red lines in Figure 11. As can be seen, the model provides a 
good fit to the observations from scales of ~ 30 kpc out to 
scales beyond 10 Mpc. On scales smaller than 30 kpc, the 
AGN show a small, but significant excess in clustering with 
respect to the model. This is nicely in line with the results 
presented in the previous section. 



7 SUMMARY AND CONCLUSIONS 

In this paper, we have analyzed the clustering of Type 2 
narrow-line AGN in the local Universe using data from 
the Sloan Digital Sky Survey. The two physical questions 
we wish to address are, a) How do the locations of galax- 
ies within the large-scale distribution of dark matter influ- 
ence ongoing accretion onto their central black holes? b)Is 
AGN activity triggered by interactions and mergers between 
galaxies? To answer these questions, we analyze the scale- 
dependence of the AGN/galaxy cross-correlation function 
relative to control samples of non-AGN that are closely 
matched in stellar mass, redshift, structural properties, and 
mean stellar age as measured by the 4000 A break strength. 
This close matching is important because previous work has 
established that the clustering of galaxies depends strongly 



on properties such as luminosity, stellar mass, colour, spec- 
tral type, mean st ellar age, concentrat i on and stellar sur- 
face mass density jNorberg et alJ 120021: IZehavi et afll200l 
l2005t iLi et alJ l200o) . Previous work has also established 
that AGN are not a random subsample of the the under- 
lying galaxy population. Rather, they are found in massive, 
bulge-dominated galaxies; powerful AGN tend to occur in 
galaxies with smaller black holes a nd younger-than-average 
stellar populations fo r their mass (iKauffmann et aDl2003l : 
iHeckman et al.ll2004h . If we wish to understand whether 
there is a real physical connection between the location of a 
galaxy and the accretion state of its central black hole, it is 
important that we normalize out these zero'th order trends 
with galaxy mass, structure and mean stellar age. 

When we compare the clustering of AGN relative to 
carefully matched control samples, and we take the errors 
due to cosmic variance into account, we obtain the following 
results: 

(i) On scales larger than a few Mpc, the clustering am- 
plitude of AGN hosts does not differ significantly from that 
of similar but inactive galaxies. 

(ii) On scales between 100 kpc and 1 Mpc, AGN hosts 
are clustered more weakly than control samples of similar 
but inactive galaxies. 

(iii) On scales less than 70 kpc, AGN cluster more 
strongly than inactive galaxies, but the effect is weak. The 
excess number of close companions is only one per hundred 
AGN. 

Our clustering results on large scales demonstrate that 
the host galaxies of AGN are found in similar dark matter 
halos to inactive galaxies with the same structural proper- 
ties and stellar masses. We have used mock catalogues con- 
structed from high-resolution N-body simulations to show 
that the AGN anti-bias on scales between 0.1 and 1 Mpc 
can be explained by AGN residing preferentially at the cen- 
tres of their dark matter halos. Our result on small scales 
indicates that although interactions may be responsible for 
triggering AGN activity in a minority of galaxies, an alter- 
native mechanism is required to explain the nuclear activity 
in the majority of these systems. 

As we have already mentioned, it is easy to understand 
why dark matter halo centres may be preferential places for 
ongoing growth of black holes. These are the regions where 
gas would be expected to cool and settle through radiative 
processes. In addition, dynamical friction will erode the or- 
bits of satellite galaxies within a dark matter halo until they 
sink to the middle and merge with the central object. Both 
these processes may bring fresh gas to the central galaxy and 
fuel episodes of nuclear activity and black hole growth. As 
we have seen, however, the evidence for an excess number 
of close neighbours around AGN is rather weak, perhaps 
because in most cases the offending satellite has already 
been swallowed. We also note that even the most power- 
ful AGN in our sample are less luminous than the quasars 
with M(i) < -24 for which ISerber et al.l J200fj) detected an 
excess number of companions on small scales. What about 
the evidence for cooling? 

Direct observational evidence for cooling from hot X- 
ray emitting gas at the center s of d ark matter halos has 
also been elusive. iBenson et all J200(J) used ROSAT PSPC 
data to seach for extended X-ray emission from the halos of 
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three nearby, massive spiral galaxies. Their 95 percent up- 
per limits on the bolometric X-ray luminosities of the halos 
show that the present day accretion from any hot virial- 
ized gas surrounding the galaxies is very small. Recently 
iPedersen et alJ i2006t) detected a gaseous halo aroung the 
quiescent spiral NGC 5746 using Chandra observations, but 
this remains the only spiral galaxy with evidence for on- 
going accretion from an extended reservoir of hot gas. In 
clusters, X-ray spectroscopy has shown that m ost of the gas 
gas does not manage to co ol below 10 7 K (e.g. iDavid et alJ 
l200ll:lPeterson et alJlioOlh . 

Gas accretion in the form of cold Hi-emitting clouds 
is, however, much less well-constrained. In recent work, 
Kauf frnann et all i2006T) studied a volume-limited sample of 
bulge-dominated galaxies with data both from the Sloan 
Digital Sky Survey and from the Galaxy Evolution Ex- 
plorer (GALEX) satellite. Almost all galaxies with bluer- 
than-average NUV-r colours were found to be AGN. By an- 
alyzing GALEX images, these authors demonstrated that 
the excess UV light is nearly always associated with an ex- 
tended disk. They then went on to study the relation be- 
tween the UV-bright outer disk and the nuclear activity in 
these galaxies. The data indicate that the presence of the 
UV-bright disk is a necessary but not sufficient condition 
for strong AGN activity in a galaxy. They suggest that the 
disk provides a reservoir of fuel for the black hole. From time 
to time, some event transports gas to the nucleus, thereby 
triggering the observed AGN activity. 

The GALEX results indicate that the extended disks 
of galaxies play an important role in the fuelling of AGN. 
The clustering results from the SDSS indicate that AGN 
are preferentially located at the centres of dark matter 
halos. In theoretical models, rotationally supported disks 
are expected to form at t he centers of dark matter halos 
JMo. Mao, fc Whitdll99g|) . After the galaxy is accreted by 
more massive halos and becomes a satellite system, the disks 
may lose their gas via process es such as ram-pressure strip- 
ping fe.g. lCavatte et al]ll994l) . Disks located at halo centres 
are likely likely to survive for longer periods. Dynamical per- 
turbations driven by the dark mattter near the centres of the 
halos may res ult in gas inflows an d fuelling of the central 
black hole fsee lOao fc Whitell2006l . for a recent discussion). 
Further progress in understanding the AGN phenomenon in 
the local Universe will require detailed modelling of the ob- 
servable components of galaxies within evolving dark matter 
halos, as well as further investigation of the connection be- 
tween AGN activity and phenomena such as bars, warps, 
lopsided images, and asymmetric rotation curves. 
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